Picture for J. Zico Kolter

J. Zico Kolter

Carnegie Mellon University

ROGUE: Misaligned Agent Behavior Arising from Ordinary Computer Use

Add code
May 29, 2026
Viaarxiv icon

Understanding and Mitigating Premature Confidence for Better LLM Reasoning

Add code
May 23, 2026
Viaarxiv icon

Base Models Look Human To AI Detectors

Add code
May 19, 2026
Viaarxiv icon

The Finetuner's Fallacy: When to Pretrain with Your Finetuning Data

Add code
Mar 17, 2026
Viaarxiv icon

Mamba-3: Improved Sequence Modeling using State Space Principles

Add code
Mar 16, 2026
Viaarxiv icon

Mimetic Initialization of MLPs

Add code
Feb 06, 2026
Viaarxiv icon

Antidistillation Fingerprinting

Add code
Feb 03, 2026
Viaarxiv icon

When Should We Introduce Safety Interventions During Pretraining?

Add code
Jan 11, 2026
Viaarxiv icon

From Entropy to Epiplexity: Rethinking Information for Computationally Bounded Intelligence

Add code
Jan 06, 2026
Viaarxiv icon

Comparing AI Agents to Cybersecurity Professionals in Real-World Penetration Testing

Add code
Dec 10, 2025
Figure 1 for Comparing AI Agents to Cybersecurity Professionals in Real-World Penetration Testing
Figure 2 for Comparing AI Agents to Cybersecurity Professionals in Real-World Penetration Testing
Figure 3 for Comparing AI Agents to Cybersecurity Professionals in Real-World Penetration Testing
Figure 4 for Comparing AI Agents to Cybersecurity Professionals in Real-World Penetration Testing
Viaarxiv icon